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II. 2 

Composition of Binary Quadratic Forms 
and the Foundations of Mathematics 


Harold M. Edwards 


Writing to Leopold Rronecker on June 14, 1846, Ernst Kummer said of his newly 
created theory of ideal prime factors: 

Dirichlet strongly urged me to work the theory out completely and submit it to Crelle 
for publication as soon as possible. He also told me and showed me, from oral and 
written indications of Gauss himself, that when Gauss was completing the section 
of Disqu. arith. on composition of forms he had something similar to ideal factors 
for his private use, but that he had not put it on firm ground. Specifically, Gauss 
says in a note to his treatise on the factorization of integral rational functions into 
linear factors something like: “Had I been willing to use imaginaries in the way that 
earlier mathematicians did, I would have been able to simplify substantially one of 
my researches which, as it is, is quite difficult.” Gauss later told Dirichlet that the 
reference here was to the composition of forms. 1 

William Waterhouse has convincingly argued that Gauss was referring in the 
footnote Kummer mentions not to the composition of forms in sec. 5 but to the 
unfinished sec. 8 of Disquisitiones Arithmeticae. 2 One should not, however, allow 

1. [Kummer 1846/1910]: Dirichlet hat mich sehr ermahnt die Theorie baldfertig auszuar- 
beiten und Crelle zum Drucke zu iibergeben. Auch hat er mir erzahlt und gezeigt, ndmlich 
aus mtindlichen und schriftlichen Aeufierungen von Gauss, dafi Gauss schon bei Anfer- 
tigung des Abschnittes de compositione formarum aus den Disqu. arith. etwas ahnliches 
wie ideale Factoren zu seinem Privatgebrauche gehabt hat, dafi er dieselben aber nicht 
auf sicheren Grund zuriickgefuhrt hat, er sagt ndmlich in einer Note seiner Abhandlung 
tiber die Zerfdllung der ganzen rat. Functionen in linedre Factoren ohngefahr so: ,, Wenn 
ich hdtte auf dieselbe Weise verfahren wollen wie die friiheren Mathematiker mit dem 
imagindren, so wiirde eine andere meiner Untersuchungen die sehr schwierig ist sich auf 
sehr leichte Weise haben machen lassen. “ Dafi hier die compositio formarum gemeint 
ist, hat Dirichlet spater miindlich von Gauss erfahren. 

2. See [Waterhouse 1984], The eighth section is discussed by G. Frei in chap. II.4 of the 
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II. Key Mathematical Techniques of the D.A. 


this important correction to cancel the remaining, more interesting part of Kummer’s 
assertion. Although one of the three men, Gauss or Dirichlet or Kummer, appears to 
have misremembered or misunderstood what had occasioned a footnote published 
47 years earlier, they all seem to have thought in 1 846 that Gauss used “something 
similar to ideal prime factors’’ for his own calculations of compositions of forms 
when he was composing the Disquisitiones, but that he had not put it on “firm 
ground.” Consideration of such a possibility raises an interesting question about 
the Disquisitiones: What was Gauss’s conception of “firm ground” in 1801, and - 
regardless of what he might have left out - what firm ground underlay the theories 
that he did include? 

There are no statements about the foundations of mathematics in the Disquisi- 
tiones. A glimpse of Gauss’s views appears in his statement in the preface that all of 
mathematical analysis is the study of general properties and relations of numerical 3 
quantities, whereas number theory (arithmetic) studies just whole numbers. This 
attitude implies that he thought of mathematics as being founded on the notion of 
“number,” but he seems never to have discussed, in the Disquisitiones or elsewhere, 
his conception of “numbers.” 4 In sec. 7 he certainly computes with irrational num- 
bers - for example, the values of the trigonometric functions for arguments of the 
form iTtp/q with integral p and q in art. 336 - but he gives no explanation of them. 
He does not even justify his use of them in a book on arithmetic other than to say 
that “the exposition will make abundantly clear that this subject is linked to higher 
arithmetic in an intimate connection.” 5 I infer from these few remarks that Gauss’s 
view of mathematics was that it deals with computations with numbers, and that, like 
many other mathematicians since, his interest lay in pursuing mathematics itself, not 
in investigating its metaphysical underpinnings in the notion of number. 

1. The Composition of Forms in the Disquisitiones 

The difficult theory of composition of forms in sec. 5 is indeed closely related 
to Kummer’s ideal prime factors, so it is not surprising that Kummer, Dirichlet 
and Gauss would have discussed connections between the two. Kummer explicitly 
mentioned binary quadratic forms in his first paper on ideal prime factors, [Kummer 
1 847], saying that the theory of numbers of the form x+y\ZT) leads to a theory of ideal 
factors, and that the natural way of partitioning these ideal factors into equivalence 
classes corresponds exactly to Gauss’s way of partitioning binary quadratic forms into 
equivalence classes. He saw this as a powerful validation of his theory because the 


present book [Editors’ note], 

3. Gauss does not refer specifically to numerical quantities, but I am told that Maser’s use 
of this term in his 1889 German translation correctly describes the way in which Gauss’s 
contemporaries would have understood his phrase. 

4. A small note by Gauss on his conception of magnitudes, “Zur Metaphysik der Mathe- 
matik,” published in vol. XII of his Werke, is discussed by J. Boniface in chap. V.l of the 
present book [Editors’ note], 

5. Gauss’s Disquisitiones Arithmeticae, art. 335: Tractatio ipsa abunde declarabit, quam 
intimo nexu hoc argumentum cum arithmetica sublimiori coniunctum sit. 
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Gaussian classification of forms, although it appeared artificial from the standpoint 
of the theory of forms, had been demonstrated by Gauss to be more fruitful than the 
obvious classification. Unfortunately, Kummer gave no detailed explanation, and he 
never returned to the subject of ideal prime factors of numbers x + y\JT) and binary 
quadratic forms. 

A great obstacle for modern students of Gauss’s theory of composition of forms 
(arts. 234-25 1) is Gauss’s use of the word “composition” to denote an operation that is 
not a binary operation. Modern treatments normally ignore the composition of forms 
altogether and deal only with the composition of equivalence classes of forms, which 
A a binary operation. Even Andre Weil, [Weil 1984], p. 334, says the Gaussian theory 
was a “stumbling-block” until Dirichlet “restored its simplicity,” without noting that 
Dirichlet only composed forms that satisfy certain additional conditions (conditions 
of “concordance”). Dirichlet in fact made no attempt to compose forms, as Gauss 
had done, but instead focussed on the question of determining which numbers were 
represented by which forms; in this study, it is natural to replace a form by an 
equivalent form whenever it is convenient to do so, and that is what Dirichlet did. In 
other words, he did not compose the two given forms, but instead replaced them, when 
necessary, with equivalent forms in order to find forms that were easy to compose. In 
this way, he solved the problems that interested him and avoided the complications 
of Gauss’s theory, but he left aside the challenging problem Gauss had successfully 
solved, the problem of composing arbitrarily given forms. (See [Dirichlet 1851] or 
§146 of [Dirichlet-Dedekind 1879].) 

Composition of forms is an elaboration of the ancient formula 

(. x 2 — Dy 2 ){u 2 — Dv 2 ) — (xu + Dvy) 2 — D(xv + yu ) 2 , (0) 


where D is a specified integer. Given three binary quadratic forms /, </>, and F (in 
the ancient example, all three are the form X 2 — DY 2 ), a transformation formula is 
a formula f(x, y)<f>(u, v ) = F(X, Y ) where X and Y are linear functions 



a li 
021 


an 

022 



_ 1 

~ XU ~ 

an 

014 


XV 

023 

024 _ 


yu 



_ yv _ 


of the four monomials xu, xv, yu and yv. (Gauss did not, of course, write the trans- 
formation equations in the matrix form used here.) A transformation is a composition 
(art. 235) if (1) the six 2 x 2 minors of the matrix [o,y ] have greatest common divisor 
1 and (2) the first two minors, that is, 011022 — 021012 and 011023 — 021013 , are both 
positive. In the example (0), the transformation is given by the matrix 


1 0 0 D 
0 110 


so it is a composition because the six minors are 1, 1, 0, 0 ,—D and —D. Actually, 
Gauss only imposes the technical condition (2) at a later stage in the development of 
the theory (art. 239). 
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— 341 — 

App + 2 Bpq + Cqq = aa' [1] 

Ap'p' + 2 Bp'q' -f Cq'q' = ac' [2] 

Ap“p“ + 2 Bp"q" -f Cq "q" = ca' [3] 

Ap‘"p"‘ + 2,Bp"‘q'" + Cq"'q‘“ = cc‘ [4] 

App' + B (pq' + qp') + Cqq • = ab‘ [5] 

App " + B (pq“ + qp") + Cqq " — ba' ... [6] 

Ap'p •" + B ( p'q -f q'p '" ) + Cq'q •" = be' [7] 

Ap"p‘" + B ( p"q‘" + q"p<") + Cq"q>" == cb‘ [8] 

A (pp'" + p'p") + B ( pq '" + qp " 1 -f- p'q" 

+ W) + c (99" J + 9'9 ") = 2bb> ■ • M 

Sint determinantes formarum F, f, f' resp. D, 
d, d‘; diuisores communes maximi numerorum 
A, 2 B, C-, a, 2 b, c; a 7 , 2b 1 , c' resp. M , m, m' 

(quos omnes positiue acceptos supponimus). 

Porro determinentur sex numeri integri 21 , 93 , £, 

21 ', Sb', (E' ita vt sit 21 a -f 2 Sbb + (Ec = m, 

21 'a ' + 2%>‘b‘ + S 'c' = m'. Denique designen- 
tur numeri pq' — qp ' , pq" — 9 P" ■, pq — 9 P u, t 
p'q" — q'p", p'q'" — q'P'", p"q"‘ — q“p"‘ resp. 
per P, Q, R, S, T, U, sitque ipsorum diuisor 
communis maxim us positiue acceptus = k. — 
lam ponendo 

App'" + B ( pq '" + qp 1 ") + Cqq'" = bb 1 + A [to] 
fit ex aequ. 9 

Ap'p" 4- B (p'q " + q'p") + Cq'q" = bb' — A [l l] 

Ex his vndecim aequatiombus 1 ... 1 1 , se- 
quentes nouas euoluimus*): 

*) Origo htrum aequationum hue eit : 1 2 « 5 . g I. 2 i 

13 «* 5- 9 — 1. 7 ■— 2. 6 i 14 « IO. II — 6 . 75 IS 
«x 5. 8 + 5. 8 + 10. 10 + 11. ii — i. 4 — 2. 3 — 

* 5 

Fig. II.2A. Computations at the core of Gauss’s composition of forms: 
an extract from art. 235 in the 1801 edition of the Disquisitiones arithmeticae. 

Gauss sets down in equations [l]-[9] the equations that describe a composition 
formula and begins the long solution of the problem. ’’Given two forms, determine 
whether there is a third form that composes them, and, if so, find all such forms.” 
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Note that this definition rests on the firm ground of computation with whole 
numbers. The forms /, cp and F are described by triples of integers, and a com- 
position is described by equations of the form given above. Composition is not a 
binary operation, but a ternary relation. Given / and <p, there may not be any F at 
all for which there is a composition formula, so it is meaningless to talk about ‘‘the 
composite of two given forms.” 6 To make matters worse, if there is an F there are 
certainly infinitely many of them, because X and Y can be subjected to an arbitrary 
unimodular change of variables. 

Gauss says (art. 234) “thus far no one has considered this topic.” Small wonder. 
Ever since Gauss, mathematicians have struggled with it. The first theorem he proves 
states: If / and </> can be composed, the ratio of their determinants must be a ratio of 
squares. The proof is a display of algebraic virtuosity that occupies a few pages. Just 
as demanding and lengthy is his proof of the converse: if the ratio of the determinants 
of two forms is a ratio of squares, a third form can be written as a composite of them. 
His proof of this theorem is of course a construction; given two forms, and given that 
the ratio of their determinants is equal to a ratio of squares, his proof (art. 236) is an 
algorithm for constructing a third form and a 2 x 4 matrix that fulfills the conditions 
that define compositions. (In fact, in his masterly fashion, he shows how to construct 
all possible compositions of them.) 

This is lengthy and daunting, but Gauss has only begun to do what he needs to do. 
Earlier (art. 158) he has defined what it means for two forms to be equivalent. In the 
main this definition is the natural one - each form can be obtained from the other by 
a change of variables with integer coefficients - but, as Gauss was the first to realize, 
addition of the seemingly unnatural requirement for the determinant of the change 
of variables to be positive improves the theory. He next proves that compositions of 
equivalent forms are equivalent. More precisely, if a form F can be expressed as a 
composite of two forms / and /', and if f" is a form equivalent to /', then there 
is a form F' that is a composite of / and /", and any such F' is equivalent to F. 
Note that the second statement implies, when f' — f", that two compositions of the 
same pair of forms are equivalent. 

And more: He needs to prove that this binary operation on equivalence classes 
is associative, a theorem that requires several more pages of work. Because he 
deals with composition of forms rather than of classes of forms, his statement of this 
associative law in art. 240 needs to be rather lengthy: 

If from the forms /, /' the form F is composed, from F, f" the form J) from /, 

/" the form F' and from F', f the form J 7 , then the forms 5 and J 7 are equivalent. 

Finally, after eight long and difficult articles (arts . 234-24 1 ) dealing with compo- 
sition of forms in complete generality. Gauss turns in art. 242 to the specific problem 
of computing composites of given forms. Again I would like to emphasize that he 
composes forms, not equivalence classes. For example, in art. 243 he gives himself 
the problem of finding a form that is a composite of the forms (3, 1 , 47), (4, 0, 35), 

6. Both the English and the German translations of the Disquisitiones wrongly translate the 
theorem of art. 249 when they use definite articles rather than indefinite ones; the original 

Latin of course has no articles. 
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— 573 — 

solus ingreditur, qui est valor expr. 

( mod. h h ). Si e. g. quaeritur forma composita 
ex (16, 5, 19 ) et (8, 1, 57), est h — 2, * =s 
4, a = 5, v ss 3. Hinc A = 8, valor 
expr. | mod. 8), qualis est 1, vnde B = 8k 
— 75, adeoque faciendo & = 9, JJ s=s — 1 
atque C = 57, slue (8, — - l, 57) forma quae- 
sita. 


Propositis itaque formis quotcunque, quarum 
termini initiales omnes sunt potestates numero- 
rum primorum, circumspiciendum erit, num ali- 
quarum termini antecedentes sint potestates eiusdem 
numeri prirni, atque hae inter se respectiue per 
regulam modo traditam componendae. Hac ra« 
tione prodibmit fortnae, quarum termini primi 
etiamnum erunt potestates numerorum primorum, 
Sed omnino diuersOrumj forma itaque ex his 
composita per ohsera. tertiam definiri- poterit. 
E. g. propositis formis (5, 1, 47), (4> «, 55 ), 
(a, o» 28), (16, % 9), (9, 7, ax), (16, 6, n), 
ex prima et quinta conflatur forma (27, 7, 7); 
ex secuucla et quarta confit ( 16, — 6, 11 , ex 
hac et sexta (i, o, 140), quae negligi potest. 
Supersunt itaque (5,0,28), (27, 7, 7), ex qui- 
bus producitur (155, — 20, 4 ) , cuius loco assu- 
mi potest proprie aequiualens (4, o, 35 ). Haec 
itaque est resultans ex compositione sex proposi- 
tarum. 


Ceterum ex hoc fonte plura alia artificia in 
applications vtilia hauriri possuntj sed ne nimis 


Fig. II.2B. A composite of 6 forms: 

an extract from art. 243 in the 1801 edition of the Disquisitiones arithmeticae. 
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(5, 0, 28), (16, 2, 9), (9, 7, 21) and (16, 6, 11), all of which have determinant -140. 
The form he finds is (135, —20, 4). 

He goes on to mention that (135, —20, 4) is equivalent to (4, 0, 35) - a fact that 
follows easily from the presence of 4 in both and the divisibility of the middle terms in 
both by 4 - perhaps because (4, 0, 35) is a simpler representative of the equivalence 
class of the result, and, as was noted above, if F is a composite of / and /', then 
any form equivalent to F is also a composite of / and /'. The result (4, 0, 35) can 
also be found in the following way: as Gauss states, (27, 7, 7) is a composite of the 
first and fifth forms, (3, 1, 47) and (9, 7, 21); and, as is easily found (see below), 
(4, 0, 35) is a composite of the fourth and sixth forms, (16, 2, 9), and (16, 6, 1 1). 
Since (20, 0, 7) is a composite of (4, 0, 35) and (5, 0, 28) - easily found because 
4 and 5 are relatively prime - a composite of all six forms but the second is found 
by using the fact that 27 and 20 are relatively prime to conclude that (540, —20, 1) 
is a composite of (27, 7, 7) and (20, 0, 7) and therefore is a composite of the five 
forms other than (4, 0, 35). This form (540, —20, 1) is equivalent to the principal 
form (1,0, 140), as the last coefficient 1 shows, so any composite of all six forms 
must be equivalent to the composite of (1, 0, 140) and (4, 0, 35) and must therefore 
be equivalent to (4, 0, 35). 

My main point is that computations of this sort are the core of Gauss’s theory of 
composition of forms. Gauss has gone to great lengths to describe in full generality 
the ways in which they may be done and the properties they have. His immediate 
purpose is, as the following sections of the Disquisitiones show, the proof of the law 
of quadratic reciprocity, which he extracts from simple facts about composition of 
primitive equivalence classes of forms for various determinants. 7 This marvellous 
proof leaves the reader with an impression that the theory is a powerful tool that will 
open the way to other realms of arithmetic, as indeed it has. 

2. Revisiting the Composition of Forms 

I would now like to describe a simple method of accomplishing the composition of 
forms that I hope will give some insight into the operations involved and into the 
way in which Gauss’s approach, cumbersome as it is, does place the theory on the 
firm ground of computations with integers in an admirable and rather natural way. 

Let an integer D , not a square, be fixed. I will take the addition and multiplication 
of numbers x + y\Z~D , where x and y are integers, for granted. By a module of 
numbers of the form x + y \Td, where x and y are integers, I will mean a list of 
(a finite number of) such numbers written between square brackets, [x\ + y i \[Y), 
xy + yysfo, ..., x n + y n s/~D]. The term “module” is motivated by the following 
definition: a module is congruent to zero modulo another module, written 

[x\ + y\\T5, ..., x n + y n y/~D] = 0 mod [x[ + y[ VT), ..., x 'm + y'm'f®] 


7. A form is said to be primitive if the coefficients (a , b, c) have no common divisor; if there 
is a primitive form in an equivalence class, all the forms of the class are primitive and the 
class is said to be primitive. 
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if each of its entries is a sum of multiples of entries of the other module in the sense 
that for each i = 1 , 2,..., n there are integers u\, u m , ft, L 2 ,..., v m such that 

m 

Xi + y;VD = E (u„ + v a 4l>)(x' a + y' a V~D). 

<T=1 


Two modules are by definition equal if each is congruent to zero modulo the 
other. As is easily seen, two modules are equal if and only if each can be transformed 
into the other by a sequence of operations of three types: (1) Rearrange terms. (2) 
Annex or delete zeros. (3) Add a multiple of one entry in the module to another entry 
- the multiplier being a number of the form x + y\^D. We can then find a simple, 
uniquely determined representation of a given module: 

Theorem. Let an integer D, not a square, be fixed, and let a list x\ + y\\fD, 
X 2 + >’2 '/O, . . . , x„ + y n \/7) of numbers of the form x + y\Z~D be given. Provided 
at least one of the listed numbers is not zero, there are nonnegative integers e, f and 
g for which ef 0, g < f, g 2 = D mod / and 

[X\ + yi VI), X 2 + T 2 Vt>, . . . , x n + VnV/Z] = [ef, eg + e\/~D]. 

Two modules in this form [ef, eg+g\Z~D], where e, f and g are nonnegative integers, 
ef ^ 0, g < /, and g 2 = D mod /, are equal only if they are identical. 

Proof. Let a module be called full if ~/D times any entry in the module can be 
written as a sum of integer multiples of entries in the module. Every module is equal 
to a full module, as one can prove as follows: Double the length of the module by 
annexing to the end a number of zeros equal to the number of terms in the module. 
To each of the zeros in the second half, add ~J~D times the corresponding term in 
the first half. Then \[D times any term in the second half is equal to D times the 
corresponding term in the first half, so the new module is both full and equal to the 
original one. 

Since every module is equal to a full module, it will suffice to prove that every full 
module that is not equal to [0] is equal to one in the required form [ef, eg + g\Jl)]. 
This will be done in two stages. 

Stage one: Because reversing the sign of an entry in a module obviously gives an 
equal module, one can assume without loss of generality that the coefficient of \[Y) 
in each term listed in the module is nonnegative. Because the module is assumed 
to be full and not equal to [0], at least one entry must contain \J~D with a positive 
coefficient. (Use is made here of the assumption that \[T) is not an integer.) If only 
one entry does, pass to stage two. Otherwise, choose an entry in which the coefficient 
of \[Y) is positive but otherwise as small as possible and subtract this entry from each 
other entry in which the coefficient of \[T) is positive. The new module obtained in 
this way is equal to the old one, and the coefficients of \[D are all nonnegative. The 
new module is also full. (A number x + y\J~D is a sum of integer multiples of the 
entries in the old module if and only if it is a sum of integer multiples of entries in 
the new module, and sTd times an entry in the new module is either \[Y) times an 
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entry in the old module or it is a difference of two such, so it is certainly a sum of 
integer multiples of entries in either the old or the new module.) Such a step reduces 
the total of the coefficients of f~D (by (k — 1) times the smallest of the nonzero 
coefficients, where k is the number of nonzero coefficients), so repetition of the step 
eventually reaches a full module in which all entries but one are integers, and the one 
entry that is not an integer contains f~D with a positive coefficient, at which point 
one passes to stage two. 

Stage two: Given a full module in which all terms but one are integers, one can 
again reverse signs, if necessary, to find an equal full module in which all terms but 
one are nonnegative integers. Delete all zeros from the module. At least one positive 
integer remains, because the module is assumed to be full and not equal to [0] (and 
D is not a square, so (x + y\/T))\Jl) = p (x + y \/l)) is impossible for integer //). 
If only one remains, a module of the form [a, b + fJ)\ that is full and equal to the 
original module has been reached. Otherwise, among the integers in the module 
(now all positive) choose one that is as small as possible, subtract it from each of the 
other integers in the module, and delete all zeros that result. Since each step of this 
type reduces the total of the integers in the module, repetition of it eventually results 
in a full module of the form [a, b + c\/l) | equal to the original module. 

Thus, given any module that is not equal to [0], one can construct a full module 
of the form [a, b + cff)\ that is equal to it. Moreover, one can assume without loss 
of generality that a and c are positive. Because a f~D — p. ■ a + v • (b + cfT)) where 
lx and v are integers, a — v ■ c. Moreover, bf~D + cD — o ■ a + r • (b + cf~D), so 
b — r ■ c and cD — oa + r b = ovc + r 2 c. Thus, with e — c, f — v and g — r, 
the module is [ef, eg + ef~D], where g 2 + of — D, so g 2 = D mod /. Since eg 
can be changed by any multiple of ef, g can be replaced by any integer congruent to 
it mod /, and one can assume without loss of generality that 0 < g < f, in which 
case the module has the required form. 

Suppose now that both [ef, eg + ef~D] and [ [e'f', e' g' +e'f~D[ have the required 
form, and suppose they are equal. Since [ef, eg + ef~D] is full, the statement that 
[e'f, e'g’+e's/D] = 0 mod [ef, eg + eft)] implies e'f' — p -ef + v-(eg + efl5) 
and e'g' + e'f~D — a ■ ef + r ■ (eg + ey/~D). Since v must be zero, e'f' must be 
a multiple of ef. By symmetry, ef must also be a multiple of ef. Since they are 
both positive integers, ef — e'f'. Similarly, since e' = r • e and, by symmetry, e is 
also a multiple of e' and both are positive, e — e’ . Thus, / = /'. Since r must then 
be 1, eg’ = e'g' = a ■ ef + eg, so g' must be congruent to g mod /. Since both are 
nonnegative and less than f , g — g', and the proof is complete. 

A module of this form [ef, eg + f~D] will be said to be in canonical form. 
(The integers e, f and g are nonnegative, ef f 0, g < f, and g 2 = D mod /.) 
The Theorem solves the problem: “Given two modules, determine whether they are 
equal.” Each is equal to one in canonical form, and two in canonical form are equal 
only if they are identical. 

Modules can be multiplied in a natural way: the entries of the product module 
are the products of two factors in which the first factor is from the first module and 
the second factor is from the second. This definition depends, of course, on the fact 
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that it is consistent with the definition of equality of modules, which is to say that 
if one of the two modules is replaced by an equal module, the product module is 
replaced by an equal module. This is easy to prove. 

Thus, every module can be written as a product [ e ] [/. g + \[Y) ] in which the first 
factor is [e] for a positive integer e and the second factor is [/, g + yf~D], where / is a 
positive integer and g is a square root of D mod /. Multiplication of any module by 
[ e ] is easy, so the multiplication of two modules in canonical form, say [e][/, g+VD] 
and [e'][f', g' + \/~D] comes down to the computation of [/, g + s/~D][f r , g' + s/D] 
which is to say the reduction of [//', f(g' + VD), f'{g + Vd), gg' + D + 
(g + g')+fD] to canonical form. This operation contains the essence of the idea 
of the composition of forms. 

For example, Gauss’s statement, mentioned above, that (27, 7, 7) is a composite 
of (3, 1, 47) and (9, 7, 21) follows from 

[3, 1 + V— 140][9, 7 + a/^140] 

= [27, 3(7 + V— 140), 9(1 + V^MO), -133 + SV^MO] 

= [27, 21 + 3V-140, 9 + 9V— 140, 2 + 8V-140] 

= [27, 21 + 3V-140, 7 + V— 140, 2 + 8V-140] 

= [27, 0, 7 + V— 140, -54] = [27, 7 + V-140], 


(These two forms (3, 1, 47) and (9, 7, 21) are concordant in Dedekind’s sense, which 
is to say that the greatest common divisor of a = 3, a — 9 and b + ft = 1+7 is 1. 
Therefore the composite (27, 7, 7) is determined, as Dedekind showed, by the fact 
that B — 1 must be 1 mod 3 and 7 mod 9 and must be a square root of — 140 mod 27.) 

Similarly, the above statement that (4, 0, 35) is a composite of (16, 2, 9) and 
(16, 6, 11) follows from 

[16, 2 + V— 140][16, 6 + V-140] 

= [256, 16(6 + V-140), 16(2 + V-140), -128 + 8 V-140] 

= [8][64, 12 + 2V— 140, 4 + 27^140, -16 + V^MO] 

= [8][64, 8, 4 + 2V-140, -16 + V-140] 

= [8][8, 4 + 2V-140, V-140] = [8][4, V-140], 

(This is a composition of forms that are not concordant in Dedekind’s sense, which 
is to say that the greatest common divisor of a = 16, a — 16, and b + f — 8 is not 
equal to 1. Therefore, Dedekind’s method does not produce a composite.) 

If / and f' are relatively prime, the product [/, g + VD][/', g' + ~J~D] of 
two modules in canonical form with e = 1 is simply [//'. G + VD], where G is 
determined mod ff' by G = g mod / and G = g' mod /'. This is a consequence 
of the fact that there are integers a and r for which of + r f' = 1; since both 



2. Edwards: Composition of Binary Quadratic Forms 


139 


ct/(G + s/D) and xf'(G + s/D) are zero mod [/, G + s/~D)[f' , G + s/~D], so is 
their sum G + \fT), and 

[/, g + S5][f, g' + VD] = [f, g + SD][f, g + Sd] 

= Iff, f(G + >fD), f'(G + VD), (G + Vd) 2 ] 

= [//', /(G + Vd), /'(G + VD), (G + Vz)) 2 , G + VD] 

= [//', g + Vd], 

Explicitly, multiplication of modules can be used to construct composites of 
given forms 8 in the following way (assuming, of course, that the ratio of their deter- 
minants is a ratio of squares): 

Theorem. Let ax 2 + 2bxy + cy 2 and au 2 +2^uv + y v 2 be given forms, and suppose 
that the ratio of their determinants is a ratio of squares, but that the determinants 
themselves are not squares. An explicit composition formula 

(ax 2 + 2b xy + cy 2 )(au 2 + 2 fiuv + y v 2 ) = AX 2 + 2 BXY + CY 2 (1) 

can be constructed as follows. Choose positive integers s and o for which s 2 (b 2 — 
ac ) and a 2 (f 2 — ay) are equal. Let D denote their common value, which is by 
assumption not a square. Put the module [sa, sb + \Z~D][aa, (rfi + \fO \ in canonical 
form, say [so, sb + ~jD)[oa, af + s/~D] = [EHC, G + s/D], Then formula (1) 
holds for 

AX 2 + 2 BXY + CY 2 = ± — (FX 2 + 2 GXY + HY 2 ) (2) 

M 

when the sign is the sign of aa, when H — (G 2 — D)/F, when m , // and M 
are the positive integers dehned by [m] = [a, 2b, c], [/r] = [a, 2fi, y] and [M] — 
[F, 2 G, //] (in short, they are the “contents” of the forms ax 2 + 2 bxy + cy 2 , an 2 + 
2/3uv + yv 2 and FX 2 + 2 GXY + // Y 2 ) and when X and Y are the linear functions 
of xu, xv, yu and yv determined implicitly by 

(sax + (sb + y/~D)y)(oau + (cr/J + y/~D)v) = E(FX + (G + s/~D)Y). (3) 

Proof. Reversing the sign of either of the given forms merely reverses the signs of 
both sides of (1) (the sign of the right side is reversed because the sign of (2) is 
reversed), so there is no loss of generality in assuming that a and a are both positive. 
Neither a nor a can be zero because the determinants are by assumption not squares. 
The definition (3) of A and Y obviously implies saxv+oa.yu + (sb+ofi)yv = EY , 
after which it implies EFX — saaaxu + saofixv + aasbyu + (sbafi + D)yv — 


8. The degenerate case in which the given forms factor over the rationals - which is to say 
that their determinants are squares - will be ignored. 
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G(saxv + aayu + ( sb + aft)yv) so the explicit expression of X and Y in terms of 
xu, xv, yu and yv is 




~ sacra 

sa(ofi—G) 

aa(sb—G ) 

s bcr/3 + D — G (s b+erfi ) 


~ XU 

X 


EF 

EF 

EF 

EF 


XV 

Y 


0 

sa 

E 

aa 

E 

sb+ofi 

E J 


1 

1 


Multiplication of the defining equation (3) by its conjugate ( sax + (sb — */D)y) 
(a au + (a ft — \fD)v) = E(FX + (G — s/D)Y) - which is the same statement as (3) 
- gives ((sax + sby ) 2 — (s 2 b 2 — s 2 ac)y 2 )((aau + aftv) 2 — (<r 2 ji 2 — o 2 cry)v 2 ) = 
E 2 ((FX + GY) 2 - ( G 2 - FH)Y 2 ), that is, 

s 2 a 2 aa(ax 2 + 2 bxy + cy 2 )(au 2 + 2ftuv + yv 2 ) = E 2 F(FX 2 + 2 GXY + HY 2 ). 

9 9 e" 2 172 -r p 2 p p 

Division by so a a gives equation (1) with A — s 2 a 2 ac/ > 2 B — and 

p 2 p pi q p 2 p 

C — , ■, — . Thus, the theorem will be proved if r •f is shown to be equal 

s L o L aa r s L o L aa 1 

to if the entries of the matrix in (4) are shown to be integers and if the greatest 
common divisor of the 2x2 minors of this matrix is shown to be 1. (The first two 
minors and are positive because s, a, a, a and F are all positive.) 

By definition, [so, sb + ~/D][oa, aft + s/D] = [£][F, G + \/Z)]. Thus, E, F 
and G are found by putting [sacra, saaft+sa\Z~D, aasb+aa^ 2 !), sbaft+D+(sb+ 
aft)\fD] in canonical form. Let (p', fp", and *p w (Gauss’s notation) be such that 
sa+ty” aa+ty"’ (sb+aft) is the greatest common divisor of sa, era, and sb+aft, 
call it d. Then d clearly divides both coefficients of all four numbers in the product 
module [sacra, saaft + sa\ 2 T), crash + era \fO, shaft + D + (sb + aft) VD] with 
the possible exception of shaft + D, and it divides this coefficient as well because 
D = (sb) 2 mod sa, so shaft + D = shaft + (sb) 2 = sb(aft + sb) = 0 mod d. 
Let Go be defined by the equation d(Go + s/D) — fp' sa(aft + ~JD) + ip" oa(sb + 
s/D) + (shaft + D + (sb + aft)s/D). In other words, 

G 0 = -(^ saaft + «P ” crash + <P'"( sbcr^ + D)). 
d 

Then 

[sa, sb + \/~D][aa, aft + V~D] 

= [sacra, saaft+sas/~D, aasb+aa^D, sbaft + D+(sb+aft)s/~D, d(Go + s/~D)] 
= [sacra, saaft — saGo, crash — craGo , shaft + D — (sb + a/3)Go, d(Go + Vi))]. 

This module is full because it can be found by starting with the full module [sa, sb + 
«/D](aa, aft + \/T>] (full because each factor is full), annexing a zero, and adding 


9. By the definition of M, the coefficients A, 2 B, C of the composite form are integers. 
For (1) to be a composition in the Gaussian sense, 2 B must be proved to be even, but, 
for reasons explained below, this point will be ignored and (1) will be accepted as a 
“composition" without a proof that 26 is even. 
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integer multiples of other entries to this zero - namely, adding 'ip', <p" and ip'" times 
the appropriate entries. Let Fq be defined by 

[r/Fo] = [sacra, saaft — saG o, crash — oaG o, shaft + D — { sb + aft)G o]. 

Then the module [dFo,dGo + d\TT)\ can be reached by successively subtracting 
integer multiples of entries of [saaa, saaft — saGo , aasb — oaG o, shaft + D — 
{sb + aft)Go, d(Go + s/~D)] to reduce the integers in this module to their greatest 
common divisor dFo while leaving d (Go + sfD) unchanged, so it is full, as well as 
equal to the product module. 

Because [dFo, dGo + ds/D] is full and d and Fq are positive, it is in canonical 
form, except that Go may not be in the range 0 < Go < Fq. Because \d Fq, dGo + 
ds/~D] — [F][F, G + s/~D], it follows that E — d, F — Fq, and G = Go mod F. 

In particular, E is the greatest common divisor of sa, a a and sb + aft, which 
shows that the entries in the second row of the matrix in ( 4 ) are integers. Moreover, 
EF — dFo is the greatest common divisor of saaa, saaft — saG q, aasb — aaG o 
and shaft + D — {sb + aft)Go, from which it follows, because Gq = G mod F, 
that the entries in the first row of the matrix in ( 4 ) are integers. 

The product of [sa, sb + s/~D\ with its conjugate [so, sb — s/~D] is [sa, sb + 
s/~D][sa, sb — s/D] = [(sa) 2 , sa{sb — \/~D), sa{sb + y/~D), s 2 b 2 — s 2 {b 2 — ac )] = 
sb + s/D, sb — s/D, sc] = [sfl][sa, 2 sb, sc, sb + s/~D] = [sa~\[sm, sb + 
\/T)], where m is as in the statement of the theorem. Similar calculations apply to the 
other two modules in the equation [sa, sb+s/D] [era, aft+s/D] = [F][F, G+s/D], 
so multiplication of this equation by its conjugate gives 

[sa][sm, sb + sfD][aa][an, aft + Vd] = [E 2 ][F][M, G + s/d]. 

Since sb = —sb mod sm, aft = —aft mod ap and G = — G mod M, the mod- 
ules in this equation are self-conjugate and multiplication of the equation with its 
conjugate amounts to squaring and results in 

[saaa/[sm][sm, sb + \f~D][a fi][a /x , aft + s/~D ] — [E 2 F] 2 [M][M, G + y/~D] 

which combines with the previous equation to give 

[saaa][smaix][E 2 F][M, G + s/d] = [E 2 F] 2 [M][M, G + Vd], 

When G is reduced mod M (the result must be 0 or \M) the modules on either 
side are in canonical form, and saaa ■ smap — E 2 F ■ M follows. In other words, 

/ = '/j-, as was to be shown. 

s z o z aa M 

Finally, let A ,-y for 0 < ; < j < 4 be the minor of the matrix in ( 4 ) that uses 
columns i and j . It remains to show that the greatest common divisor of the A ( / is 
equal to 1 . By direct computation, 

[F 2 F][Ai2, Ai3, Ai4, A 23, A 24, A34] 

= [Maa][ifl, aa, sb + aft, aft — sb, sc, ay]. 
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(The calculation is simplified when one observes that adding G times the second 
row to the first does not change the minors.) What is to be proved, then, is that 
[ E 2 F ] — [.sacra] [set, era, sb + a/3, ay, sc], which is to say 

[£][saera, sa(a/3 — Go), aa(sb — Go), sba/3 + D — (sb + a/3)Go\ 

— [iflffa][ja, era, sb + a/3, a/3 — sb, ay, sc]. 

The first three terms on the right - that is, saaa times sa, era and sb + a/3 - are 
all divisible by Esaaa and are therefore zero modulo the module on the left. The 
remaining three are zero modulo the module on the left by virtue of 

era sa 

saaa(a/3 — sb) — — • Esa(a/3 — Go) Eaa(sb — Go) 

E E 

sb + aB sa 

saaaay — • Esa(af3 — Go) E(sbaj3 + D — (sb + a/3)Go) 

E E 

sb + aB a a 

saaasc — • Eaa(sb — Go) E(sba/3 + D — (sb + af3)Go)- 

E E 

Finally, the four terms in the module on the left are zero modulo the module on the 
right by virtue of 


Esaaa = (p 's 2 a 2 aa + " saa 2 a 2 + fp m (sb + aji)saaa 

Esa(a/3 — Go) = fp ” saaa(afi — sb) + *P '"saaaay 
Eaa(sb — Go) = ?P ' saaa(sb — af3) + (P '"saaasc 
E(sba/3 + D — (sb + a/3)a/3) = (p 'saaaay + saaasc, 

(the last three equations are obtained by eliminating one of the *Ps from E — 'p's a + 
^"cra + ip w ( sb + a/3) and EGo = *P 'saa/3 + (p "aasb + (p "'(sba/3 + D)) and 
the proof is complete. 

Allowing forms to have odd middle coefficients permits the theorem to take the 
more natural form 

ax 2 + 2bxy + cy 2 au 2 + 2f3uv + yv 2 FX 2 + 2GXY + HY 2 
m /x M 

of a composition of two primitive forms (forms in which the greatest common divisor 
of the coefficients is 1) in which the composite is also primitive. One can obviously 
compose arbitrary forms if one can compose primitive forms, so it is natural to 
restate the theorem in the form: Given two primitive forms ax 2 + bxy + cy 2 and 
an 2 + /3uv + yv 2 , if the ratio of b 2 — 4 ac to f3 2 — 4 ay is a ratio of squares, the 
obvious modification of the construction of the theorem gives a composition formula 
for them in which the composite form is primitive. 

For example, to compose x 2 + xy — y 2 with itself, the theorem replaces it with 
2x 2 + 2 xy — 2y 2 and computes [2, 1 + V5][2, 1 + V5] = [2] [2, 1 + \/5] to find the 
composition (2x 2 + 2 xy — 2y 2 )(2u 2 + 2uv — 2v 2 ) = 2(2X 2 + 2 XY — 2y 2 ), where 
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X and Y are defined by (2x + (1 + V5)y)(2u + (1 + V5)i>) = 2(2X + (1 + V5)T), 
which is to say X — xu + vy and Y = xv + yu + yv. The more natural way to state 
this composition formula is of course 

(x 2 + xy — y 2 )(u 2 + uv — v 2 ) = X 2 + XY — Y 2 . 

Formula (5) can be used to construct a composition of any two forms, when the 
ratio of their determinants is a ratio of squares, whether or not they are primitive 
and whether or not their middle coefficients are even. Once a single composition 
is known, all others are obtained by taking unimodular changes of variables X' — 
pX + i qY ,Y' = rX + sY, where p, cj, r and .v are integers with ps — qr — 1. 

From Gauss’s point of view the theorem does not provide a composition of 
two given forms until B is proved to be an integer, or, in the terms of the modified 
theorem, until it is proved that if the middle coefficients of the given primitive forms 
are both even, the middle coefficient of the composite form given by the theorem is 
even. This statement is true, as follows from Gauss’s construction of art. 236, but 
it is a matter of little importance unless there is a reason to restrict consideration to 
forms with even middle coefficients, a point on which I and many other of Gauss’s 
readers remain unconvinced. 

3. Conclusion 

I hope that the use of module multiplication in some measure simplifies Gauss’s 
theory of composition of forms. For example, it clarifies the difficult associativity 
property described and proved by Gauss in art. 240. Multiplication of modules is 
obviously an associative binary operation, and this property easily translates into the 
property Gauss uses. 10 

But, more importantly, I hope that by focussing attention on Gauss’s composition 
of actual forms - as opposed to equivalence classes of forms as in the modern theory - 
I have highlighted Gauss’s great achievement in giving a rigorous treatment of the 
composition of binary quadratic forms in the greatest possible generality. 

His theory is “rigorous,” not only in the usual sense that it is mathematically 
convincing, but also in the literal sense that it makes great demands on the reader. 
The second kind of rigor has caused succeeding generation of mathematicians to 
devote some of their best efforts to avoiding it. But it is the first kind of rigor 
that makes Gauss the great master. It is based on his mastery of the computational 
structure of his subject and his ability to explain that structure in the most general 
circumstances. While they may seek to avoid the difficulties of Gauss’s theory, 
succeeding generations should never cease to admire it. 


10. Since multiplication of modules can be used to establish the theory of composition of 
forms, Gauss’s proof of quadratic reciprocity using composition of forms can be deduced 
in this way. However, quadratic reciprocity can be proved working directly with mul- 
tiplication of modules. Therefore, if the goal is quadratic reciprocity, one can dispense 
with quadratic forms altogether. Other aspects of Gauss’s theory can be revisited in a 
similar way. See [Edwards 2005], 
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